8 |
An Introduction to Complex Systems: Making Sense of a Changing World
|
|
|
|
In: Faculty Books (2019)
|
|
BASE
|
|
Show details
|
|
9 |
Should we use movie subtitles to study linguistic patterns of conversational speech? A study based on French, English and Taiwan Mandarin
|
|
|
|
In: Third International Symposium on Linguitic Patters of Spontaneous Speech ; https://hal.archives-ouvertes.fr/hal-02385689 ; Third International Symposium on Linguitic Patters of Spontaneous Speech, Nov 2019, Taipei, Taiwan ; http://lpss2019.ling.sinica.edu.tw/ (2019)
|
|
BASE
|
|
Show details
|
|
10 |
Segmentability Differences Between Child-Directed and Adult-Directed Speech: A Systematic Test With an Ecologically Valid Corpus
|
|
|
|
In: EISSN: 2470-2986 ; Open Mind ; https://hal.archives-ouvertes.fr/hal-02274050 ; Open Mind, MIT Press, 2019, 3, pp.13-22. ⟨10.1162/opmi_a_00022⟩ (2019)
|
|
BASE
|
|
Show details
|
|
11 |
A computational account of virtual travelers in the Montagovian generative lexicon
|
|
|
|
In: The Semantics of Dynamic Space in French ; https://hal.archives-ouvertes.fr/hal-02093536 ; Michel Aurnague; Dejan Stosic. The Semantics of Dynamic Space in French, John Benjamins, pp.407-450, 2019, Part IV. Formal and computational aspects of motion-based narrations, 9789027203205. ⟨10.1075/hcp.66.09lef⟩ ; https://benjamins.com/catalog/hcp.66.09lef (2019)
|
|
BASE
|
|
Show details
|
|
12 |
Towards TreeLex++: Syntactico-Semantic Lexical Resource for French
|
|
|
|
In: Language & Technology Conference ; https://hal.archives-ouvertes.fr/hal-02120183 ; Language & Technology Conference, May 2019, Poznan, Poland (2019)
|
|
BASE
|
|
Show details
|
|
13 |
On the integration of linguistic features into statistical and neural machine translation
|
|
|
|
In: Vanmassenhove, Eva Odette Jef orcid:0000-0003-1162-820X (2019) On the integration of linguistic features into statistical and neural machine translation. PhD thesis, Dublin City University. (2019)
|
|
BASE
|
|
Show details
|
|
14 |
Neural machine translation for multimodal interaction
|
|
Dutta Chowdhury, Koel. - : Dublin City University. School of Computing, 2019. : Dublin City University. ADAPT, 2019
|
|
In: Dutta Chowdhury, Koel (2019) Neural machine translation for multimodal interaction. Master of Science thesis, Dublin City University. (2019)
|
|
BASE
|
|
Show details
|
|
15 |
Cross-lingual parsing with polyglot training and multi-treebank learning: a Faroese case study
|
|
|
|
In: Barry, James orcid:0000-0003-3051-585X , Wagner, Joachim orcid:0000-0002-8290-3849 and Foster, Jennifer orcid:0000-0002-7789-4853 (2019) Cross-lingual parsing with polyglot training and multi-treebank learning: a Faroese case study. In: The 2nd Workshop on Deep Learning Approaches for Low-Resource NLP (DeepLo 2019), 3 - 5 Nov 2019, Hong Kong, China. ISBN 978-1-950737-78-9 (2019)
|
|
BASE
|
|
Show details
|
|
16 |
Selecting artificially-generated sentences for fine-tuning neural machine translation
|
|
|
|
In: Poncelas, Alberto orcid:0000-0002-5089-1687 and Way, Andy orcid:0000-0001-5736-5930 (2019) Selecting artificially-generated sentences for fine-tuning neural machine translation. In: 12th International Conference on Natural Language Generation, 29 Oct - 1 Nov 2019, Tokyo, Japan. (2019)
|
|
BASE
|
|
Show details
|
|
17 |
Automatic processing of code-mixed social media content
|
|
Barman, Utsab. - : Dublin City University. School of Computing, 2019. : Dublin City University. ADAPT, 2019
|
|
In: Barman, Utsab (2019) Automatic processing of code-mixed social media content. PhD thesis, Dublin City University. (2019)
|
|
BASE
|
|
Show details
|
|
18 |
Automatic error classification with multiple error labels
|
|
|
|
In: Popović, Maja orcid:0000-0001-8234-8745 and Vilar, David (2019) Automatic error classification with multiple error labels. In: MT Summit XVII, 19 - 23 Aug 2019, Dublin, Ireland. (2019)
|
|
BASE
|
|
Show details
|
|
19 |
Improving transductive data selection algorithms for machine translation
|
|
Poncelas, Alberto. - : Dublin City University. School of Computing, 2019. : Dublin City University. ADAPT, 2019
|
|
In: Poncelas, Alberto orcid:0000-0002-5089-1687 (2019) Improving transductive data selection algorithms for machine translation. PhD thesis, Dublin City University. (2019)
|
|
Abstract:
In this work, we study different ways of improving Machine Translation models by using the subset of training data that is the most relevant to the test set. This is achieved by using Transductive Algoritms (TA) for data selection. In particular, we explore two methods: Infrequent N-gram Recovery (INR) and Feature Decay Algorithms (FDA). Statistical Machine Translation (SMT) models do not always perform better when more data are used for training. Using these techniques to extract the training sentences leads to a better performance of the models for translating a particular test set than using the complete training dataset. Neural Machine Translation (NMT) can outperform SMT models, but they require more data to achieve the best performance. In this thesis, we explore how INR and FDA can also be beneficial to improving NMT models with just a fraction of the available data. On top of that, we propose several improvements for these data-selection methods by exploiting the information on the target side. First, we use the alignment between words in the source and target sides to modify the selection criteria of these methods. Those sentences containing n-grams that are more difficult to translate should be promoted so that more occurrences of these n-grams are selected. Another extension proposed is to select sentences based not on the test set but on an MT-generated approximated translation (so the target-side of the sentences are considered in the selection criteria). Finally, target-language sentences can be translated into the source-language so that INR and FDA have more candidates to select sentences from.
|
|
Keyword:
Computational linguistics; Machine learning; Machine translating
|
|
URL: http://doras.dcu.ie/23726/
|
|
BASE
|
|
Hide details
|
|
20 |
Combining SMT and NMT back-translated data for efficient NMT
|
|
|
|
In: Poncelas, Alberto orcid:0000-0002-5089-1687 , Popović, Maja orcid:0000-0001-8234-8745 , Shterionov, Dimitar orcid:0000-0001-6300-797X , Maillette de Buy Wenniger, Gideon and Way, Andy orcid:0000-0001-5736-5930 (2019) Combining SMT and NMT back-translated data for efficient NMT. In: Recent Advances in Natural Language Processing (RANLP 2019), 2-4 Sept 2019, Varna, Bulgaria. (2019)
|
|
BASE
|
|
Show details
|
|
|
|